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Abstract 

In  all  applications  of  term  rewriting  systems,  computing  the  normal  forms  of 
terni.s  is  the  fundamental  step.  In  tiiis  paper,  we  propose  a  directed  acyclic 
graph  (dag)  representation  for  term  and  term  rewriting.  Tlie  rewriting  on  dags 
is  much  more  efficient  tlian  the  rewriting  on  trees.  \\'<'  design  se\eral  efficient 
strategies  for  rewriting  on  dags.  By  dag  representation,  we  can  even  obtain 
efficient  rewriting  strategies  for  non-left -linear  rewriting  systems. 
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1      Introduction 

In  any  application  of  term  rewriting;  systems  (TRSs  for  short),  romputinc;  normal 
form  of  terms  is  a  basic  procedure.  In  language  application,  the  semantics  of  a  term 
is  its  normal  form  [DerSo].  In  solving  word  problem  [KB70].  to  test  equivalence  of 
two  terms  i  and  ,•'.  the  normal  forms  of  /  and  .-<  are  c()mi)uted.  To  unify  terms  in 
an  equation  theory,  the  equations  are  transformed  to  a  terminating  and  confluent 
TRS  and  the  normal  forms  of  the  terms  are  computed  [HuSO].  To  combine  logic 
and  functional  progrannning  languages.  TRSs  are  attached  to  Horn  clauses  and  in 
unification  of  clauses,  normal  form  are  comjnited  [SYSG][GMSG].  Therefore,  (efficiency 
of  normal  form  computation  is  important  in  all  applications  of  TRSs. 

The  research  for  efficient  normal  form  computati(.)n  is  done  in  many  ways.  Huet 
and  Levy  [HL79]  have  given  a  theoretic  analysis  for  derivations  and  proposed  '"strong 
sequential"  TRSs  which  can  be  transformed  to  an  automata  computing  normal  form 
fast.  In  [CKSS7].  Choppy  et  al.  have  given  technicjues  for  estimating  the  expecttxl 
number  of  steps  in  the  derivations  with  the  goal  of  constructing  efficient  '■reenlar'" 
TRSs.  In  [Li90].  XP-completeness  results  are  obtained  for  efficient  normal  form 
computation  and  several  optimal  strategies  are  proposed  for  subclasses  of  TRSs. 

Conventionally,  also  in  all  discussions  above  for  efficient  normal  form  con:iputation. 
terms  are  assumed  to  be  represented  by  trees.  But  tree  representation  is  inefficient 
in  space.  If  we  allow  common  subterms  to  be  shared,  a  term  can  be  rej^resented  by  a 
directed  acyclic  graph  (dag  for  short).  It  is  well-known  that  there  is  some  term  that 
can  be  represented  by  a  dag  with  ?;  nodes,  but  it  has  to  be  represented  by  a  tree 
with  exponential  number  of  nodes.  Based  on  dag  representation  of  terms.  Paterson 
and  Wegman  [PWTS]  have  designed  an  efficient  unification  algorithm.  In  rev(-rse.  the 
cause  of  unefficiency  of  Robinson's  vmification  algorithm  [RoliTl]  is  due  to  the  tree 
representation.  Besides  the  advantage  of  space-saving,  dag  representation  of  terms 
can  save  computing  time,  which  is  rewriting  steps  in  term  rewriting.  For  example, 
given  a  TRS: 

./■i(.r)  -  /(/,(.r)./,(.r)) 
M-r)  -  /(/3(x)./3(.r)) 

/,_,(.r)  -^  /./„(-r)./„(.r)) 

and  given  a  term  f\{(i).  in  tree  representation,  each  rewriting  will  introduce  two 
subterms  that  are  actually  the  same.  Eventtially.  we  need  exponential  of  ;/  rewritings 
to  rewrite  f]{a)  to  its  normal  form.  But  if  terms  are  represented  by  dags,  only  linear 
number  of  rewritings  are  needed. 

Another  good  exam])le  is  compiUation  of  Fibonacci  numbers,  which  is  shown  in 
[HPSSa]. 


Barendregt  et  al.  [BEG87]  have  proposed  "term  graph  rewriting",  and  Hoffman 
and  Plump  [HPSSa]  have  proposed  "jungle  evaluation".  In  the  both  research,  terms 
are  represented  by  mechanisms  like  dags,  the  goal  being  that  common  sul;terms  arc 
shared  and  normal  form  computation  will  be  more  efficient.  But  in  the  both  repre- 
sentations for  terms,  some  common  subterms  may  not  be  shared.  In  the  rewririne;. 
to  maintain  the  structure  of  dag  representation,  extra  work  is  needed.  The  both 
papers  have  not  analyzed  the  trade-off  between  the  extra  work  and  the  efficiency  of 
rewriting. 

In  this  paper,  we  propose  a  dag  representation  for  terms,  which  shares  the  most 
number  of  common  subterms.  One  of  the  advantages  of  this  representation  is  that 
each  term  is  mapped  to  a  unique  dag.  This  uniciueness  will  provides  us  a  clear  nn- 
derstanding  for  the  relationship  between  the  rewriting  on  trees  and  the  rewriting  on 
dags.  Properties  of  term  rewriting,  such  as  termination,  confluence,  non-ambiguity. 
etc..  will  be  compared  for  two  representations.  We  will  see  that  given  a  canonical 
(terminating  and  confluent)  TRS  represented  by  trees,  the  corresponding  TRS  repre- 
sented by  dags  is  also  canonical.  Furthermore,  we  will  show  that  the  unique  normal 
form  of  a  term  for  a  canonical  TRS  can  be  efficiently  comi')uted  by  the  corres])onding 
TRS  in  dag  representation. 

For  rewriting  on  dag  representation,  we  will  give  detailed  analysis  on  how  to  find 
a  redex.  how  to  rewrite  it.  and  how  to  maintain  the  dag  structure. 

Another  interesting  point  in  this  paper  is  to  give  optimal  or  "good""  strategies  fin- 
rewriting  on  dags.  This  has  not  been  done  for  the  existing  systems  for  rewriting  on 
dags  [BEGS7] [HPSSa].  We  will  see  that  the  strategies  for  efficient  rewriting  on  trees, 
which  are  proposed  in  [Li90].  can  be  adapti'd  for  efficient  rewriting  on  dags.  With 
dag  representation,  the  applications  of  these  strategies  are  expanded  to  much  larger 
subclasses  of  TRSs. 

The  most  of  previous  researches  on  efficient  rewriting  have  been  done  only  for 
left-linear  TRSs.  In  this  paper,  with  dag  representation,  we  obtain  several  results  on 
non-left-linear  TRSs. 

In  the  following,  we  will  introduce  c:)Ui  dag  representation  for  terms  and  TRSs 
in  section  2.  discuss  relationship  between  rewriting  on  trees  and  rewriting  on  dags 
in  section  3.  and  address  efficient  rewriting  strategies  for  our  dag  rejiresentation  of 
TRSs  in  section  4.   Section  5  concludes  the  pajier  and  suggests  fut\u-e  work. 


2      Dag  Representation  of  Terms  and  Dag  Rewrit- 


ine: 


Recursively,  a  term  is  defined  to  be  a  constant  or  a  variable  or  / ( /] t,_ )  where  /  is  a 

function  of  arity  77  and  /i /„  are  terms.  In  convention,  a  term  is  dejjicted  as  a  tree: 

constants  and  variables  are  the  bottom  nodes  and  each  function  corresponds  to  an  dis- 
tinct inner  node.  For  e\-amj)le.  term  f((i(li((i.a).  h((i.  a)),  gihia.  a  ).  Ii{a.  a)).  }i{a.  (i)) 
is  a  tree  shown  in  figure  1(a)  If  we  allow  subterms  to  l^e  shared,  a  term  can  be 
represented  by  a  directed  acyclic  graph  (.sliort  as  dag).  For  exam]ile.  the  term  in 
figure  1(a)  is  represented  by  a  dag  in  figure  1(M  in  which  an  arc  represents  a  directcvl 
up- down  edge. 


The  figures  apparently  show  that  the  dag  representation  of  a  term  save  space 
than  the  tree  representation.  Besides  advantage  of  space-saving,  dag  representation 
of  terms  can  save  a  lot  of  computing  time  —  rewriting  steps  in  words  of  rewriting 
systems.  In  the  early  stage,  dag  representation  is  used  to  efficiently  conipute  recur- 
sive programs  [PIvIT74][BL79].  Recently,  dag  representation  is  proposed  to  generate 
efficient  reductions  for  general  rewriting  systems  [BEGS7][HKPSS][HPSSa].  Due  to 
the  advantage  of  saving  space  and  time,  dag  representation  is  preferred  in  many 
applications  of  rewriting  systems. 

Our  topic  is  how  to  take  the  greatest  advantage  of  time-saving  by  dag  representa- 
tion in  term  rewriting.  For  a  term,  there  may  exist  several  corresponding  dags.  See 
figure  2.   The  term  in  figure  2(o)  can  be  represented  by  each  of  the  5  dags  in  figure 

2  from  ib)  to  (/).   The  dag  in  figure  2(/)  shares  the  most  number  of  subterms. 

The  reason  why  dag  representation  saves  reduction  time  for  rewriting  systems  is 
that  many  rewritings  are  performed  on  the  shared  subterms  so  that  many  duplicated 
rewritings  are  avoided.  For  example,  given  rewriting  system  {a  — »  /*}  and  term 
''  =  f{9(o-o)-  g{ci-  o)),  4  rewritings  have  to  be  done  to  i.  If  /  is  represented  by  the 
dags  in  figure  2,  3  rewritings  are  needed  for  (6)  and  (c),  2  needed  for  (d)  and  (f ).  1 
needed  for  (  f).    It  is  easv  to  conclude  that  the  more  subterms  are  shared,  the  less 


} 

V 

a 

(f) 

Figure  2 
number  of  rewritings  are  needed,  that  is.  the  more  reduction  time  is  saved. 

The  existing  dag  representation  for  rewriting  systems  (expressed  by  '•  graph  rep- 
resentation" in  [BEGS7]  and  "jungle"  in  [HPSS])  does  not  specificly  use  the  dag  which 
shares  the  most  number  of  subterms  and  it  permits  some  common  subterms  not  to 
be  shared.  As  an  example,  one  of  the  5  dags  in  figure  2  is  allowed  to  represent  the 
term  in  figure  2(a).  If  dag  representation  is  used  for  rewriting  systems,  the  extra 
work  has  to  be  done  to  maintain  the  dag  structure.  The  maintaining  work  is  called 
"garbage  collection"  in  [BEGS7]  and  "folding"  in  [HPSS].  We  immediately  think 
of  the  trade-off  between  the  extra  maintaining  work  and  the  saved  rewriting  steps 
for  dag  representation.  From  the  conclusion  mentioned  above,  we  i^refer  the  dag 
representation  that  shares  more  subterms  and  does  not  need  mvich  maintaining  tinie. 

Here,  we  propose  a  dag  representation  for  rewriting  systems  that  shares  the  most 
of  subterms  and  the  maintaining  work  is  not  much.  Furthermore,  we  will  propose 
strategies  for  this  dag  representation  to  speed  up  the  rewriting  process. 

First,  rewriting  on  terms  should  be  understood  clearly.  Let  /  be  a  term  and  /  — ►  r 
a  rule.  We  say  /  — >  r  can  be  applied  to  /  if  there  is  a  substitution  rr  and  a  sul)torm 
i'  in  /  such  that  la  =  t' .  This  process  is  called  a  matching,  i.e..  /  matches  i'.  Term  / 
with  a  subterm  t'  is  denoted  by  t[t'].  A  rewriting  of  /  — +  r  to  /  is  a  rei)lacement  of  /' 
by  ra  in  t.  denoted  by  t[t']  —>  t[rcr].  The  reflexive  and  transitive  closure  of  rewriting 
—*    is  denoted  by    —^  . 

In  our  dag  representation,  every  term  is  represented  by  a  dag  that  shares  all 
common  subterms.  As  an  example,  for  the  term  in  figure  2(r/).  our  corresjioiidiug 
dag  is  the  dag  in  figure  2(/).  Formally,  a  term  /  is  mapped  to  a  dag  t,iag  in  which 
there  is  only  one  occurrence  for  each  constant  and  variable,  and  if  we  have  two  terms 

p  =  /,(!', /;,. )  and  q  =  /j(.si >k)  where  /i  =  /2.  and  /,  and  -s  (1  <  /  <  A')  share 

one  node  in  the  dag.  then  /)  and  q  share  one  node  in  the  dag.  We  have 

Proposition   1    Any  tervi  i,<  mapped  to  a  unique  dag. 

Proof:  Given  a  term  i.  the  corresponding  dag  representation  idag  can  I'c  constrvicttnl 
from  bottom  to  up.  The  unicjueness  can  be  easily  shown  by  induction  on  the  lev<"l  of 
the  nodes  from  bottom  to  up.  CI 


Let  /?  be  a  term  rewriting  system  consisting  of  m  rules,  i.e..  i?  =  {/,—>  r,  |1  <;  < 
111}.  I,  and  r,  are  terms.  In  our  system,  terms  in  rules  are  also  represented  by  dags. 
Specifically.  /,  and  7-,  in  R  are  dags.  Now.  we  consider  how  rewriting  is  performed  for 
our  dag  representation. 

Each  rewriting  consists  of  4  steps: 

Step  1.  Redex  finding; 

Step  2.  Dag  splitting: 

Step  3.  Rewriting: 

Step  4.  Dag  merging. 
We  will  describe  each  step  in  tlie  detail.     After  that,  for  clear  understanding,  an 
example  will  be  given. 

Suppose  /  ^  r  is  applied  to  /. 

In  step  1.  redex  finding  is  processed  ahnost  conventionally.  But  since  terms  are 
represented  by  dags,  the  matching  of  a  term  by  a  rule  can  be  done  by  Paterson  and 
Wegman's  efficient  unification  algorithm  [PW7S]  (or  any  other  efficient  unification 
algorithm  based  on  dags),  because  matching  is  a  special  case  of  unification,  that  is. 
the  variables  in  the  term  being  matched  are  considered  as  constants.  Therefore,  redex 
finding  is  much  efficient  than  the  conventional  one  in  which  terms  are  represented  by 
trees. 

In  step  2  (dag  splitting),  make  a  copy  of  the  redex  obtained  in  step  1  and  delete 
the  nodes  in  f  which  are  in  the  redex  but  not  shared  by  any  nodes  besides  the  redex. 
We  will  let  the  root  of  the  redex  stay  and  call  it  dangling  root.  This  can  be  done  in 
linear  time,  for  example,  by  width  first  algorithm.  Sui)pose  after  this  step,  i  becomes 

In  Step  3.  rewriting  is  done  to  the  copy  obtained  in  step  2.    This  is  the  same  as 
the  conventional  one.  but  the  result  is  a  dag.  Denote  the  result  by  /re- 
in step  4.  merge  ire  vvith  tmed-     In  the  following  we  will  give  a  linear  merging 
algorithm  for  two  dags.    After  merging,  make  the  dangling  root  ol:)tained  in  step  2 
point  to  the  root  of  ire  See  the  following  example  for  clear  understanding. 

Example.  Given  rewriting  system  {g{x.y)  — >  h(x)}  and  term  f({i{a.b).g(b.c)).  The 
four  steps  to  rewrite  the  term  are  shown  in  figure  3. 

Here  is  the  algorithm  for  dag  merging. 

Dag  Merging  Algorithm. 

Input:  dag  irned  and  dag  tre- 
Output:  one  dag. 
Process: 


step  1 

step  2 

Step  3 

Step  4 

f 

f 

rewriting 

f 

/  X^redex 

xaS 

A.' 

copy 

,\A.^ 

Ij     c          b 

A) 

^ 

dans 

ling  root         ^' 

merge  /med  and  / 

Imed 

tre 

Figure  3 

a)  Merge  the  bottom  nodes  of  t„ied  and  tre-  that  means. 
if  there  are  two  common  bottom  nodes  in  /„,ej  and  ir^. 
merge  the  duphcated  node  in  /rt  to  the  one  in  tmed- 

b)  In  tre-  from  bottom  to  up.  for  each  inner  node  c  do: 

if  all  children  of  c  have  been  merged  into  tmed  and  the 
children  have  a  fatlier  c  in  t^^d.  merge  c  in  tre  to  c  in 
tmed-  otherwise,  keep  c  in  tre- 
Proposition  2    The  dag  merging  algorithm  generates  a  dag  and  iJie  rv.nnivg  tunc  is 
linear  to  \tmed\  +  Itrel-   Actually,  in  the  most  of  cases,  the  time  is  almost  linear  to  |^,-f|. 

Proof:  At  step  a)  in  the  algorithm,  all  the  bottom  nodes  are  merged  so  that  there 
is  only  a  single  node  for  a  variable  or  a  constant.  By  induction  on  the  levels  of  the 
nodes  in  tre  from  bottom  to  up.  it  is  easy  to  verify  that  there  is  no  redundant  copys 
of  any  subterm.  That  is.  a  dag  is  produced. 

Merging  bottom  nodes  of  tmed  and  tre  can  be  done  in  time  linear  to  the  nimiber 
of  the  bottom  nodes.  For  each  inner  node  in  tre-  ^'^'^  check  its  children  to  see  if  it 
can  be  merged  into  t^ed-  Notice  that  the  arities  of  ftmction  symbols  are  fixed,  so  the 
time  for  step  b)  is  0{\tre\)-  The  total  time  is  0{\tmed\  +  |Ae|)-  Actually,  for  /,„,rf.  we 
only  consider  the  bottom  nodes  of  imei-  which  are  constants  and  variables.  Generally, 
in  a  term,  most  of  symbols  are  function  symbols  while  there  are  only  a  few  distinct 
constants  and  variables.  Therefore,  in  most  of  cases,  the  algorithm  is  almost  linear 
to  |^,|.  □ 

If  terms  are  represented  b}-  dags  and  rewritings  on  dags  are  jierformed  ])y  the 
method  described  above,  we  call  this  rewriting  a  dag  rewriting.  The  conventional 
rewriting  on  trees  is  called  a  tree  rewriting.  In  the  same  way.  we  call  a  TRS  rejire- 
scnted  by  trees  a  tree  TRS  and  a  TRS  represented  by  dags  a  dag  TRS.  Henceforth. 
we  assume  that  dag  TRSs  are  api)lied  only  to  terms  in  dag  rejiresentation  and  tree 
TRSs  are  applied  only  to  terms  in  tree  representation. 

Now.  let's  analyze  efficiency  of  dag  rewriting. 

To  rewrite  a  term  in  dag  rewriting,  ^r-^^p  1  (described  above)  saves  time  whiU^  step 
2  and  step  4  are  extra  work.    Both  step  2  and  step  4  run  in  time  almost  linear  to 


the  size  of  the  redex  found  in  step  1.  Step  1  saves  time  in  two  hands:  (1)  Because  of 
dag  representation,  hnear  matching  algorithms  (or  any  efficient  algorithm  on  dags) 
can  be  used.  (2)  To  find  a  redex  from  term  f.  we  have  to  try  matching  algorithm  at 
several  positions  in  t.  Dag  representation  rechices  the  search  sjjace.  that  means,  h^ss 
matching  would  be  tried.  We  already  know  that  matching  algorithm  is  at  ijest  linear 
to  the  sizes  of  the  two  terms  being  matched.  Therefore,  we  can  conclude  that  the 
extra  work  in  step  2  and  in  step  4  is  balanced  by  the  time-saving  in  step  1. 


3      Characteristics  of  Tree  Rewriting  and  Dag  Rewrit- 
ing 

Traditionally,  many  properties  of  term  rewriting  systems  are  discussed  on  tree  rep- 
resentation, such  as  termination,  confluence,  ambiguity,  etc..  Here,  we  will  consider 
these  properties  on  dag  rewriting  systems.  These  properties  are  discussed  in  [HPSSb] 
for  "jungle  evaluation",  but  our  system  is  simpler  and  our  discussion  is  much  simpler. 
In  this  section,  rewriting  system  R  and  term  t  without  any  subscripts  will  be  assumed 
to  be  represented  by  trees  and  their  corresponding  dag  representations  are  denoted 
by  Rdag  and  t^iag  respectively. 

Rewriting  Derivations 

Intuitively,  given  a  tree  TRS  R  and  the  corresponding  dag  TRS  Rdag-  a  rewriting 
derivation  Ddag  using  Rdag  is  a  shorter  version  of  D  using  R.  Formally,  we  have 

Proposition  3  Given  R  and  the  corresponding  Rdag-  ^f'i  Ddag  be  a  derivation  using 
Rdag  which  rewrites  term  t  to  term,  t' .    Then,  there  is  a  derivation  D  using  R  such 

that  D  rewrites  t  to  t'  and  \D\  >  \Ddag\-  •.  ■  ■  'i  '  ■  ' 

Proof:  Consider  one  rewriting  step  Ddag[i]  in  Ddag  which  rewrites  Sdag  to  s'^^^  by  rule 
hag  —>  '''dag  hi  Rdag-  In  Ddag{i]-  oue  of  rcclcxes  of  ^dag  IS  rewritten,  say  it  is  ii .  There 
must  be  many  copies  of  u  in  .^  all  of  which  can  l)e  rewritten  by  the  ruh^  /  —  r  in  R. 
We  arrange  these  rewritings  by  /  — ♦  r  in  a  sequence,  oljtaining  a  derivation  usint;,  7? 
which  rewrites  .^  to  s' . 

We  expand  each  rewriting  step  in  Ddag  in  this  way.  obtaining  a  rewriting  derivation 
D  using  R  which  rewrites  t  to  /'.  Because  each  rewriting  step  in  Ddag  is  expanded  to 
at  least  one  rewriting  step  in  D.  we  have  \D\  >  jD^a^l.  □ 


Termination  and  Confluence 

We  say  a  TRS  R  is  terminating  if  for  any  term  /.  there  is  no  infinite  rewriting  chain 
using  R  for  t. 

Proposition  A   If  R  i»  terminating,  then  Rdag  z-'*  terminating. 

Proof:  If  there  is  an  infinite  sequence  of  rewritings  using  Rjag-  hy  proposition  3.  we 
can  construct  an  infinite  secjuence  of  rewritings  using  R.  contrary  to  tlie  conchtion 
that  7?  is  terminating.  Therefore.  R^ag  is  terminating.  D 

A  TRS  R  is  said  confl.uent  if  for  any  t.  two  rewriting  clcri\"ations  /  — «  .sj  and  /  —  >_. 

imply  that  there  exists  a  term  t'  such  that  there  are  two  rewriting  derivations  .Si  -^  t' 

and  $2  — ♦  ^'• 

Unfortunately,  the  condition  that  7?  is  confluent  does  not  guarantee  that  R.^.^g 
is  confluent.  One  of  courter-examples  is  given  by  [HPSSb]:  R  —  {/(.r)  — >  g(.r..r). 
f(a)  — >  .V.  g(a.a')  — >  A',  a  — >  a',  a'  — »  a].  R  is  confluent  for  tree  representation.  For 
dag  representation,  term  f(a)  can  be  rewritten  to  A'  and  g(a.a).  but  the  latter  can 
only  be  rewritten  to  g(a'.a').  that  means  g(a.a)  cannot  be  rewritten  to  .V  liy  rule 
gia.a')  -^  X. 

In  this  paper,  our  major  concern  is  that  given  a  terminating  and  confluent  TRS. 
how  to  compute  normal  forms  for  terms  fast.  Fortunately,  if  7?  is  hofli  terminating 
and  confluent,  we  can  guarantee  that  Rdag  is  terminating  and  confluent.  To  j^rove 
this,  we  need  the  proposition  below: 

Proposition  5   t  is  reducible  using  R  if  and  only  iftjag  ^-^  reducible  u.^ing  Rd^g- 

Proof:  First,  because  matching  (special  case  of  uniflcation)  between  dags  ol:)tains 
the  same  substitution  for  variables  as  matching  between  the  corresponding  trees  (see 
[PW7S]),  we  have  the  claim  that  t  can  be  matched  by  .s  if  and  only  if  tjag  can  be 
matched  by  s^ag- 

Therefore,  t  can  be  rewritten  by  a  rule  r  from  7?  if  and  only  if  /j,,.,  can  be  rewritten 
by  the  corresponding  rule  of  r  from  Rdag-  n 

Proposition  G  If  R  !.>••  terminating  and  confluent,  then  Rdag  z-'  trrminafinq  and  con- 
fluent. 

Proof:  Termination  of  Rdag  is  proved  above.  \\'e  prove  confluence  of  Rdag  '^^  follows. 
Given  any  term  tjag-  suppose  there  are  two  rewriting  derivations  tdag  — *  ^dag  f<iid 
tdag  — +  ^'dag-  Since  Rdag  terminating,  we  rewrite  Sdag  and  ,s^^^  to  their  normal  forms 
Udag  and  ii'^^g  respectively.  If  Udag  =  n'dag-  '^^'^  are  done.  Sui)posc  Udag  ^  y'dag-  ^y 
proposition  3,  there  are  two  tree  rewriting  derivations  using  7?  that  rewrite  /  to  u 
and  u'  respectively.    By  proposition  1,  Udag  ^  li'dag  imphes  that  u  ^  u'.    Since  7?  is 


confluent,  one  of  ii  and  u'  must  be  reducible  using  R.  By  proposition  5.  one  of  ujag 
and  i/^„  must  be  reducible  using  Rdag-  but  this  is  contrary  to  the  assumption  that 
both  Udag  and  it'j^g  are  normal  forms  for  Rdag-  Therefore,  we  must  have  ridag  =  n'dag- 
Confluence  is  proved.  ^ 


Non-ambiguity 

We  say  a  TRS  is  non- ambiguous  if  there  are  no  two  rules  /j  — ♦  r,  and  /2  — >  '"2  in  R 
such  that  a  non-variable  subterm  of  /j  can  be  unified  with  I2.  excei)t  that  /j  -+  7-1  and 
/o  -^  '"2  are  the  same  rule  and  the  non-variable  subterm  is  /i  itsrlf. 

Because  unification  between  dags  obtains  the  same  suljstitution  for  variables  as 
unification  between  corresponding  trees  (see  [PW7S]).  for  R  and  corresponding  Rdag- 
if  there  are  two  rules  Idag  -^  rdag  in  Rdag  and  /^^^  -►  r^^^  such  that  a  non-variable 
subterm  of  Idag  can  be  unified  with  I'^^^.  then  we  have  two  rules  in  /  ^  r  and  /'  -^  r' 
in  R  such  that  a  non-variable  subterm  of  /  can  be  imified  with  /'.  Therefore,  we  have 
Proposition  7  If  R  la  non- ambiguous,  then  Rdag  is  non-ambiguous. 

Normal  Form  Computation 

We  are  concerned  about  how  to  eflRciently  compute  normal  forms  for  terms  gi\-en 
terminating  and  confluent  TRSs.  For  a  terminating  and  confluent  TRS.  any  term 
has  a  unique  normal  form.  We  have  already  shown  that  if  R  is  terminating  and 
confluent,  then  Rdag  is  terminating  and  confluent.  Naturally,  we  would  ask  if  any  f 
has  the  same  normal  form  using  R  as  tdag  has  using  Rdag- 

Proposition  8  If  R  is  te.rm,ina.ting  and  confluent,  the  normal  form  of  any  term  f 
using  R  is  the  tree  representation  of  the  normal  form,  of  tdag  using  Rdag- 

Proof:  By  proposition  6.  Rdag  is  terminating  and  confluent,  tdag  has  a  unique  normal 
form  t'^^g.  that  means,  there  is  a  dag  rewriting  derivation  Ddag  using  Rdag  that  rewrites 
idag  to  t'^^g.  By  proposition  3.  there  is  a  tree  rewriting  derivation  D  using  R  that 
rewrites  t  to  /'.  By  proposition  5.  /'  is  irreducible.  Since  /  has  only  one  normal  form. 
t'  is  the  unique  normal  form  of  /.  n 

Intuitively,  for  a  terminating  and  confluent  TRS.  we  can  use  the  corresponding  dag 
TRS  to  compute  the  unique  normal  form  for  any  term  more  efficiently,  since  several 
rewriting  steps  in  tree  rewriting  are  replaced  by  one  dag  rewriting  step.  Stai)le"s 
speed-up  theorem  in  [StaSOb]  implies  the  following  proposition: 

Proposition  9  Let  R  be  a  term.inat.ing  and  non -ambiguous  TRS.  i  be  a.ny  term.  \f\ 
the  length  of  the  shortest  derivation  to  compute  t 's  normal  form,  and  \tdag\  ibe  length 
of  the  shortest  derivation  to  compute  tdag'^"  normal  form.    Then.  \tdag\  <  |'|- 


Non-left-linear 

The  '"graph  rewriting"  [BEG87]  and  "jungle  evaluation'"  [HPSSa]  have  some  trouble 
when  they  are  used  for  non-lcft-linear  rewriting  systems.  Our  systems  can  be  eas- 
ily used  for  non-lcft-linear  systems.  The  matching  algorithm  has  no  problem  when 
dealing  with  non-linear  terms  represented  by  dags  [PW7S].  In  dag  representation, 
the  same  variables  share  the  same  node.  Therefore,  the  dag  rewriting  for  a  non-lcft- 
linecir  rule  is  no  different  from  the  dag  rewriting  for  a  left-linear.  To  make  clear,  we 
give  an  example  for  a  non-left-linear  system.  Meanwhile,  to  show  how  dag  rewriting 
saves  rewriting  steps,  we  draw  out  for  tree  rewriting  and  dag  rewriting  all  jiossiljle 
derivations  that  rewrite  a  given  term  to  its  normal  form.  See  figure  4.  Wc  simply  see 
that  by  dag  rewriting,  we  ha\'e  not  only  shorter  derivations  but  also  a  much  smaller 
term  space  (that  are  all  terms  in  all  derivations).  This  becomes  important  when  we 
design  strategies  for  optimization  of  rewriting.  In  the  following  section,  we  will  give 
several  efficient  strategies. 

Example.  System={/(x,  x)  —^  g{h{x).  x).  g{x.  y)  — >  A'(x.  y).a  — >  c.  h  — +  d]  and  term 
fia.a). 


f(a.a 


f(a,a; 


f(c,a)      g(h(a).a)  f(c.c7g(h(^c),a)     gXhla).c)^  X(h(< 

g(li(c^)     N{h(a).a) 
X(h(c).c) 

dag  rewriting 

Figure  4 


4      Optimization  of  Dag  Rewriting 

Dag  representation  of  rewriting  systems  can  speed  up  the  rewriting  process,  as  de- 
scribed in  the  previous  sections.  In  this  section,  we  will  consider  how  to  speed  up 
dag  rewriting  and  propose  several  strategies  for  this  purpose. 

Optimization  of  rewriting  has  been  discussed  on  tree  representation  [HL79].  Sonu- 
researches  [BL79][PMT74]  have  been  done  for  dag  rewriting,  but  they  addressed  only 
recursive  programs,  other  than  term  rewriting  systems  that  are  our  target. 

Let  f  be  a  term  and  x  a  variable.  N urn\'ar(x  J)  denotes  the  number  of  occur- 
rences of  J  mi.  We  define: 
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Variable -fewer   TRS:  For  any  variable  x  and  any  rule  /  — >  r.  .\  inv^'ari.r.  I)   > 

Num\'ar(x.  r). 

Variable -mort;    TBS:  For  any  variable  x  and  any  rule  /   — »   r.   X imi\'ar{x.l)    < 

.\um\'ar(x.  r). 

Variable -equal   TRS:  For  any  varialtle  x  and  any  rule  /  — *  r.   X  imi\'or{x.l }   = 

X  tnt)\'ar{x.r). 

If  a  redex  is  not  a  proper  subterm  of  any  redex.  it  is  called  an  outermost  redcx. 
If  a  redex  does  not  contain  any  proper  subterm  as  a  redex.  it  is  called  an  ivvrrmnst 
redex.  Let  f  be  a  term  and  tsuh  a  proper  subterm  of  /.  The  path  from  the  root  of  /  to 
the  root  of  t^uh  (excluding  the  root  of  tsub)  is  called  the  root  path  of  /.,„,,  in  /.  denoted 
by  root-pai.h(tsub-'t)-  E-g--  f'^r  term  /  =  f{g{h{a.f{n.h))).c).  root-path(f(a.I>).f)  is 
f-g-h,  and  rooi-path(c.t)  is  /.  Let  /  be  a  term.  ts,,h  a  proper  sul)term  in  i.  and  /,„h 
also  a  redex.  If  there  is  a  rule  /  -^  r  and  a  vari;d)lr  .?■  in  /  such  that  root-path{  x.l) 
matches  a  substring  of  root-path(ts^h.t).  ■vve  call  t^ub  a  variable  redcx  (of  rule  /  ->  r)  in 
t.  E.g..  TRS  =  {f(g{x).a)  -^  h(x.(}).  b  -^  c}.  in  term  t  =  f(g(g(a.f(a.b))).c).  h  is  a 
redex  because  of  the  second  rule,  root-path(x.  f(g(x).  a))  \i^  f-g  where  f{g{x).  a]  is  the 
left  side  of  the  first  rule,  and  root-pat.h(b.i)  is  f-g-f.  so  f-g  is  a  substring  of /-^-Z  and  h 
is  a  variable  redex  in  t.  Using  these  definitions,  we  define  the  following  strategies: 

Outermost  strategy.   To  rewrite  a  term,  choose  an  outermost  redcx. 

Variable- delay  strategy.  To  rewrite  a  term,  if  there  are  non-variable-redexes. 
choose  a  non-variable-redex  to  rewrite;  otherwise,  all  of  redexes  are  variable  redexes 
and  we  choose  a  variable  redex  to  rewrite. 

Innermost  strategy.  To  rewrite  a  term,  choose  an  innermost  redex. 

In  the  previous  sections,  a  derivation  is  a  sequence  of  rewriting  steps.  To  simi)lify 
terminology,  from  now  on.  a  derivation  is  defined  as  a  secj\ience  of  rewriting  steps  to 
rewrite  a  term  to  its  normal  form.  Any  two  derivations  that  rewrite  the  same  term 
to  its  same  normal  form  are  said  equivalent.  An  optimal  derivation  is  the  shortest 
one  among  all  equivalent  derivations.  The  ;th  rewriting  in  derivation  D  is  denoted 
by  D[i].  D{i.j]  (i  <  j)  denotes  all  rewritings  from  D[i]  through  D[j].  A  derivntion 
in  which  at  each  step  the  outermost  strategy  is  used  is  called  a  outermost  derivation. 
The  variable-delay  derivation  and  innermost  derivation  are  defined  in  the  same  way. 

The  paper  [LiOO]  has  studied  the  optimization  of  tree  rewriting  for  terminating, 
left-linear  and  non-ambiguous  TRSs.  The  major  results  are: 
(1  )  For  variable-equal  TRS.  any  deri\-cition  is  optimal: 

(2)  For  variable-more  TRS.  an  innermost  derivation  is  optimal: 

(3)  For  variable-fewer  TRS.  any  derivation  D  has  an  ecjuivalent  outermost  deriva- 
tion D'  such  that  \D'\  <  \D\: 

(4  )  For  variable-fewer  TRS.  any  outermost  derivation  D  has  an  equivalent  variable- 
delav  derivation  D'  such  that  ID' I  <  1 1?,. 
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In  this  paper,  we  will  consider  validity  of  these  strategies  on  dag  rewriting  and  also 
extend  these  results  to  non-left-linear  TRSs.  Our  discussion  is  based  on  the  result 
obtained  in  the  last  section  that  if  a  tree  TRS  is  terminating  and  non-ambiguous, 
then  the  corresponding  dag  TRS  is  terminating  and  non- ambiguous.  We  will  discuss 
left-linear  TRSs  first,  and  then  consider  non-left-linear  cases.  We  assume,  if  without 
any  other  comments,  that  any  TRS  to  be  used  is  terminating  and  non-ambiguous, 
and  that  any  TRS  and  any  term  are  in  dag  representation. 

.A.  rule  r  is  said  root-rewrites  to  a  term  t  if  the  redex  is  /  itself.  Let  rule  r  rewrite 
term  /.  If  r  root-rewrites  t.  we  call  r  is  applied  on  /:  if  r  root-rewrites  a  proper 
subterm  of  t.  we  call  r  is  applied  inside  t:  if  we  don't  care  whether  this  rewriting 
occurs  on  or  inside  t.  we  say  7-  is  applied  to  t .  Let  .s  be  a  proper  subterm  of  /.  If  r 
root-rewrites  a  subterm  of  /  which  is  a  proper  superterm  of  ,'^.  we  say  the  rewritinc; 
occurs  above  s  in  /.  Let  o?  be  a  proper  subterm  of  term  t.  .A.ny  proper  subterm  of  / 
whose  root  is  not  on  the  root  path  of  .s  in  t  is  called  a  general  sibling  of  .^. 

4.1      Variable-More  TRSs 

In  tree  rewriting,  for  left-linear  and  variable-more  TRSs.  any  innermost  derivation 
is  optimal.  But  in  dag  rewriting,  for  the  same  subclass  of  TRSs.  any  derivation  is 
optimal  (no  condition  '"innermost").  That  means,  for  this  subclass  of  TRSs.  we  don't 
need  to  worry  about  the  order  of  rewritings  on  redexes.  and  we  can  use  any  derivation 
to  get  the  normal  form  with  the  shortest  length. 

Lemma  1  In  a  derivatioTi  constructed  by  a  left-linear  and  variable-more  TRS.  any 
redex  does  not  disappear  uriless  it  is  rewritten. 

Proof:  Suppose  Z)  is  a  derivation  constructed  by  a  left-linear  and  varialjle-more  TRS 
i?.  .s  is  a  redex  after  D[i].  and  D[i  +  1]  rewrites  t{:^  s). 

Case  1.  /  is  inside  .s.  t  must  be  a  variable  redex  of  .s.  Suppose  .?  is  a  redex  of  rule  r. 
Since  R  is  left-linear,  after  rewriting  of  t.  s  remains  a  redex  of  rule  r. 

Case  2.  /  is  above  5.  .s  must  be  a  variable  redex  of  t.  Since  R  is  variable-more,  o* 
remains  a  redex  after  rewriting  of  t. 

Case  3.   t  is  a  general  sibling  of  .s.   Obviously  .s  remains  a  redex  after  rewriting  of  /. 

n 

Theorem  1  In  dag  rewriting,  for  a  lef.-linear  and  variable-more  TBS.  given  any 
term,  any  derivation  for  that  term  is  oplimal. 

Proof:  Suppose  D  and  D'  are  two  derivations  for  term  t  and  \D'\  <  \D\.  Suppose 
D[l.i]  =  D'[l.i]  and  D[i  +  l]  #  D'[,  +  l].  D[,  +  I]  rewrites.^  and  D'[i  +  l]  rewrites  .s'. 
By  lemma  1.  .*;  remains  a  redex  until  it  is  rewritten  in  the  subsequent  derivation  of 
D'.  Suppose  i!  is  rewritten  in  D'  by  rule  r  at  step  D'[j]{i  <  j).  All  function  syml^ols 
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(and  constants,  if  there  are)  in  5  matched  with  function  symbols  in  the  left  side  of  r 
are  called  the  critical  part  of  ,s.  Note  that  those  subterms  in  .s  matched  with  variables 
of  r  are  not  in  the  critical  ])arr. 

Suppose  j  =  i  +  A"  [i-  j  defined  above).  Let  /,+,,  be  the  term  after  step  D'[i  +  p] 
in  D'  {0   <  p  <   k  —  1).     Root-rewrite  .<;  in  t,  by  rule  r.   t,   Ijecoming  /q.     Consider 

applying  D'[i  +  l].D'[i  +  2] D'[i  +  k  -  I]  to  t'^.  Let  t[,  be  the  term  after  rewriting 

step  D'[i  +p](0  <  p  <  k  -  1).  We  prove  by  induction  that  /,+,,  and  /|,  (0  <  p  <  k  -  1) 
are  almost  the  same  except  the  critical  part. 

Basis,  p  =  0.  Remember  that  /,+o  is  rewritten  to  /[,  by  applying  rule  r  to  the  root 
of  .s  in  ^,+0.  Because  r  is  variable-more  and  left-linear,  and  r  is  represented  i)y  a  dag. 
only  the  critical  part  is  clianged.  Hence,  the  claim  holds  for  the  Ijasis. 

Induction.  Suppose  the  claim  holds  for  p  —  1(0  <  p  <  A'  —  1).  That  means. 
t,+p-\  and  fp_j  are  almost  the  same  except  the  critical  part.  Suppose  .s'  in  /,+p_i  is 
root-rewritten  at  step  D'[i  -f  p].  We  have  three  cases. 

Case  1.  s'  is  inside  .s.  j;'  must  be  a  variable  redex  of  .s.  which  is  not  in  the  critical 
part,  that  means,  .s'  is  in  t'^_^.  Hence  D'\i+p]  can  be  applied  to  .^'  in  t'p_^.  Therefore 
tt+p  and  /'  are  almost  the  same  except  the  critical  part. 

Case  2.  0?'  is  above  .s.  $  must  be  a  variable  redex  of  i^'.  Suppose  5  is  rewritten  by 
rule  r  and  the  position  of  5'  in  i,+p_i  is  p.  Let  5"  be  the  subterm  in  t'^_^  at  position 
p.  s'  and  .s"  are  almost  the  same  except  the  critical  part  which  is  under  .s.  so  rule  r 
can  be  applied  on  ;;".  Since  .s  is  a  variable  redex  in  .s"  of  rule  r.  the  critical  part  is 
not  changed. 

Case  3.  0^'  is  a  general  sibling  of  .s.  Obviously.  D'[i  +  p]  can  be  applied  to  both  /,+p_] 
and  f'j  and  ^+p  and  t'  are  almost  the  same  except  the  critical  part. 

The  claim  is  proved.  We  have  known  that  t^+k-i  and  /;._j  are  almost  the  same 
except  the  critical  part.  Note  that  D'[j]  (i.e.  D'[i  +  k])  root-rewrites  s.  The  term 
obtained  by  this  rewriting  to  tj_\  (i.e.  t,+k-i)  is  exactly  the  same  as  /'^._j  because 
the  critical  part  is  changed.  The  significance  is  that  if  we  move  the  rewriting  on  .< 
at  D'lj]  to  just  in  front  of  D'[i  +  1].  all  the  rewritings  from  D'[i  +  1]  through  the 
end  of  D'  are  not  affected.  By  this  way.  we  construct  a  new  derivation  D"  such  that 
D"  is  equivalent  to  D' .  \D"\  =  \D'\.  and  D"[l.)  +  1]  =  D[\.i  +  1].  Keej^ing  on  this 
way.  we  can  eventually  construct  a  derivation  D'"  such  that  D'"  is  equivalent  to  D' . 
\D"'\  =  \D'\.  and  D'"  is  a  prefix  of  D.    Tiiis  is  impossible  since  D'"  is  equivalent   to 

D.  a 

4.2      Left-Linear  TRSs 

We  have  shown  that  for  a  left-linear  and  variable-more  TRS,  all  derivations  have  the 
same  length.   But  for  a  left-linear  and  variable-fewer  TRS.  for  a  term,  we  may  have 
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derivations  to  compute  its  normal  form  with  different  lengths.  The  reason  is  that 
some  variable  (say  x)  which  is  in  the  left  side  of  a  rule  does  not  occur  in  the  right  side 
of  the  rule,  so  that  when  the  rule  is  applied,  the  redcx  matched  by  .r  may  disappear 
without  rewriting  and  different  order  of  rewritings  have  different  effects.  For  example, 
given  system  {f(x.y)  — >  .V.  c;  — ►  b}  and  terni  f{a.h).  we  have  two  derivations  with 
different  lengths:  fia.b)  -+  A'  and  f{a,b)  -»  f{b.b)  —>  X . 

In  [Li90].  we  have  shown  that  for  left-linear  and  variable-fewer  any  derivation 
has  an  equivalent  outermost  derivation  which  has  no  more  rewriting  steps.  In  other 
words,  for  any  derivation,  we  may  find  an  equivalent  outermost  derivation  which  are 
more  efficient.  With  dag  representation,  we  can  lift  the  condition  '"variable-fewer" 
to  get  the  same  result,  that  means,  the  result  is  valid  for  any  left-linear  TRS  (don't 
forget  non-an:ibiguity  condition). 

Theorem  2  In  dag  rewriting,  for  a  leff.-linear  TRS.  any  derivaiiov  D  ha."  an  equiv- 
alent outermost  derivation  D'  such  that  \D'\  <  \D\. 

Proof:  Let  i?  be  a  linear  TRS.  D  a  derivation,  and  tail-length!  D)  the  length  of  the 
subderivation  in  D  fron:i  the  first  non-outermost  rewriting  through  the  end. 

We  use  induction  on  tail-length(  D )  to  prove: 

Claim:  D  is  equivalent  to  an  outermost  derivation  D'  such  that  tail-length(  Z)')  =  0 
and  \D'\  <  \D\. 

Basis.   tail-lengtli(D  )=0.  Trivial. 

Induction.  Suppose  the  claim  holds  for  the  derivation  with  smaller  tail-length  func- 
tion value  than  D.  Suppose  the  first  non-outermost  rewriting  occurs  on  the  redcx 
s.  There  must  be  an  outermost  redex  Sgut  containing  .s  properly.  Since  E  is  non- 
ambiguous,  i;  must  be  a  variable  redex  in  .Sq,,..  Suppose  the  rule  appliable  on  s^^t  is  r 

which  has  variables  Ti Xk  at  the  left  side  and  the  subterms  matched  with  .Tj r^. 

are  Si,...,Sk.  s  is  a  subterm  of  one  of  {s] Sk}.  Suppose  Dtait  is  the  subderivation 

of  D  from  the  rewriting  of  .?  through  the  end.  We  discuss  in  cases  what  rewriting  will 
happen  to  s^ut  or  above  Sout  in  Dtaii-  Let  Dhead  denote  the  remaining  subderivation 
in  D  except  Dtad. 

Case  1.    .A.11  rewritings  to  Sgy^t  or  above  arc  applied  only  on  or  inside  .'^] s^,..    This 

case  is  impossible  since  r  will  be  appliable.  even  in  the  case  that  R  is  non-left -linear. 
Because  identical  terms  will  be  rewritten  to  a  same  term  by  the  confluence  property. 
In  the  following  cases,  suppose  r'  is  the  first  rule  applied  on  a  redex  that  is  not  a 
subterm  of  any  one  of  .Sj 5^.  and  r'  is  applied  to  or  above  Sout  ■ 

Case  2.  r'  is  applied  on  a  proper  subterm  of  Sout.  r'  superposes  r,  contrary  to  the 
condition  that  R  is  non-ambiguous,  so  this  case  is  impossible. 

Case  3.  r'  is  applied  on  Soux-  Before  r'  is  applied,  all  rewritings  to  Soui  must  be  on 
or  inside  .Si.....^^  by  case  2.  Suppose  ,s,  becomes  s^[\  <  i  <  k)  and  Sout  becomes  .s^^^, 
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just  before  r'  is  applied. 

Because  R  is  left-linear,  obviously  t,  matches  s\{\  <  i  <  k)  and  r  can  be  applied 
on  s'^^f.  Since  R  is  non-ambiguous,  only  one  rule  can  be  applied  to  a  term.  Therefore. 
r'  —  r.  . 

We  move  rewriting  by  r  in  front  of  the  rewriting  on  .<;.  D  becoming  D' .  We  suppose 
that  in  D  the  rewriting  on  .^  is  D[i]  and  the  rewriting  on  s^ut  is  D[j]{j  <  j).  Suppose 
Sout  is  rewritten  to  sl^^  by  r  in  D'  and  s'^^^  is  rewritten  to  s^l^  by  r  in  D.  .s^'^,  and  .^^'j^, 

are  the  same  except  the  changes  to  .^i .s^..  Since  R  is  left-linear  and  is  represented 

by  dags  such  that  the  different  occurrences  of  the  same  variable  at  right  sides  of  rules 
point  to  the  same  node,  we  can  assume  the  distinct  variables  at  the  right  side  of  r 

are  x,j , ....  x,^(l  <  zi  <  ...  <  Zj  <  A').  In  D'.  all  .s,, .<,    are  rewritten  to  ^s^  ^[    by 

the  same  rewritings  between  D[i]  and  D[j]  in  D.  .A.ll  other  rewritings  between  D[i] 
and  D[j]  occur  on  the  general  siblings  of  s,-,ut  and  the}-  are  not  affected  by  moving 
of  the  rewriting  on  Sout^  so  they  can  be  applied  in  D' .  Hence,  D'  is  equivalent  D 
and  \D'\  <  \D\.  tail-length(D')  <  tail-length(Z)).  By  induction  hypothesis,  the  claim 
holds  for  D' . 

Case  4.   r'  is  applied  above  Sout-   By  case  2.  before  r'  is  applied,  all  rewriting  to  So^i 

must  be  on  or  inside  .si s^-  If  Sout  were  not  a  variable  redex  of  /''.  r  would  superpose 

r'.  Therefore  Sout  is  a  variable  redex  of  r'.  Suppose  the  term  being  rewritten  by  r' 
is  i.  By  non-ambiguity,  all  proper  subterms  in  f  not  corresponding  to  variables  of 
r'  are  normal  forms.  Suppose  derivation  D^ub  reduces  a  proper  subterm  t'  of  f  to 
its  normal  form  and  one  step  of  Dsub  is  in  Dtaii-  Obviously.  tail-lengtlifD^t,*, )  <  tail- 
length(Z)).  By  induction  hypothesis,  we  can  obtain  an  outermost  derivation  £),.^^, 
equivalent  to  Dsub  and  ID^^tJ  <  \Dsub\-  Since  t'  must  be  a  general  sibling  of  Sout  and 
s,  all  rewritings  to  t'  has  no  effect  to  5.  so  we  can  move  all  steps  of  D'^ub  in  front  of  the 
rewriting  step  of  .s.  and  obtain  a  derivation  equivalent  to  D  with  smaller  tail-length 
function  value.  Using  the  induction  hypothesis  again  for  the  modified  D.  we  can 
prove  the  claim  holds  for  D.  Therefore  we  can  assume  that  in  D.  all  derivation  steps 
used  in  rewriting  all  proper  subterms  in  t  not  corresponding  to  the  variables  in  r'  are 
in  Dhead-  In  this  case,  r'  is  appliable  at  the  beginning  of  -D(a,;-  l^nt  this  is  contrary  to 
the  supposition  that  r  is  an  outermost  redex  at  the  beginning  of  Dtad- 

We  have  enumerated  all  cases.  The  claim  does  hold.  The  theorem  immediately 
follows  the  claim.  D 

Even  outermost  strategy  is  not  able  to  guarantee  to  generate  tlie  optimal  deriva- 
tion. One  of  examples  is  as  follows:  system  f(.T.b)  —>  X.a  -^  h.c  —*  d  and  term 
f(c.a).  Both  c  and  a  are  outermost  redexes.  but  which  one  is  rewritten  first  has 
effect  on  the  length  of  derivation,  c  firs^:  f{c.a)  —^  f{d.a)  —*  /(d.b)  — v  A'  and  a 
first:  f(c.a)  -^  f[c.h)  -^  A'. 

The  variable-delay  strategy,  the  strategy  better  than  outermost  strategy,  is  giv- 
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en  in  [Li90]  for  left-linear  and  variable-fewer  TRSs.  With  dag  representation,  the 
"variable-fewer"  condition  can  be  lifted,  so  that  we  have  a  more  general  result  for 
variable  strategy.  Before  we  go  to  the  formal  description,  we  need  a  lemma. 

Lemma  2  Let  D  he  an  outermost  derivation  of  a  non-ombigtious  TRS  and  t  a  non- 
vanablc  outermost  redex  after  some  steps  of  D.  hi  D.  t  remains  a  non-variable 
outermost  redex  until  it  is  rewritten. 

Proof:  Suppose  after  D[i]  in  D.  Ms  a  non-variable  outermost  redex.  If  /  is  no  longer 
a  non-variable  outermost  redex  after  D[i  +  1].  there  are  only  two  possibilities:  t 
itself  is  rewritten  at  step  D[i  -f  1]  or  a  redex  /'  which  is  al)ove  t  is  rewritten  at  step 
D[i  +  1].  For  the  lemma,  we  only  need  to  discuss  the  latter  case.  Because  Ms  a 
non- variable  redex.  t  is  unifiable  with  a  non-varial)le  subterm  of/',  which  is  contrary 
to  the  condition  that  the  TRS  is  non-ambiguous.  D 

Theorem  3  In  dag  rewriting,  for  a  left-linear  TRS.  any  outermost  derivation  D  has 
an  equivalent  variable- delay  derivation  D'  such  thai  \D'\  <  \D\. 

Proof:  .  Let  i?  be  a  linear  TRS.  D  an  outermost  derivation,  and  tail-leni^thi  D) 
denote  the  length  of  the  subderivation  of  D  from  the  first  non-varia])le-delay  rewriting 
through  the  end. 

We  use  induction  on  tail-length{Z))  to  prove: 

Claim:  D  is  ecjuivalent  to  a  variable-delay  derivation  D'  such  that  tail-length(  D'  )  =  () 
and  \D'\  <  \D\. 

Basis.   tail-var-length(  D  )=0.  Trivial. 

Induction.  Suppose  the  claim  holds  for  the  outermost  derivation  with  smaller  tail- 
length  function  value  than  D.  Suppose  after  D[i]  in  D  there  is  a  non-varialile  out- 
ermost /„o„  and  a  variable  outermost  t,,r,r  and  the  latter  is  rewritten  at  strp  D[i]. 
Before  D[i].  all  derivation  steps  are  applied  by  variable-delay  strategy.  By  Irnuna  "2. 
t„c.n  ^\'ill  be  rewritten  in  D  at  step  D[j]  (i  <  j).  Construct  an  intermediate  derivation 
as  follows. 

D"  :  Move  rewriting  on  /„(,„  in  front  of  the  rewriting  on  /,.ar. 

.A.1!  rewritings  between  D[i]  and  D[j]  in  D  can  1>p  done  between  £)'[/  -4- 1]  and  D'[j] 
in  D' .  because  those  redexes  being  rewritten  V)ftween  the  steps  are  all  outermost 
and  they  camiot  be  inside  or  above  /nor,-  so  the  rewriting  on  /„,,„  does  not  affect 
them.  Therefore  D"  is  ecjuivalent  to  D  and  has  the  same  length.  By  theorem  2.  the 
suljderivation  between  D"[i  +  l]  and  the  end  has  an  ecjuivahMit  outermost  deri\-ati()n 
with  equal  or  smaller  length,  say  it  is  I?,,,,/.  Concatenating  D"[l.i]  with  with  D,.,,i. 
we  obtain  D' .  which  is  equivalent  to  D.  is  outermost,  and  has  smaller  tail-length 
function  value.  By  induction  hyj)othesis.  the  claim  holds  for  D' .  D 
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4.3      Non-Left-Linear  TRSs 

For  term  rewriting,  one  of  the  great  advantages  of  dag  represent, -it ion  is  that  (hfferent 
occurrences  of  a  variable  in  a  term  point  to  the  Scvme  node.  In  the  Uist  svibsection. 
taking  advantage  that  several  occurrences  of  a  variable  at  right  sitle  of  rules  share  one 
node,  we  showed  that  any  derivation  is  optimal  for  non-ambiguous  and  variable-more 
TRSs  and  that  outermost  strategy  and  variable-delay  are  good  candidates  for  efficient 
normal  form  com]:iutation  for  any  non-ambiguous  TRS.  hi  this  sul)s(^ction.  we  will 
take  advantage  that  the  different  occurrences  of  a  variable  at  left  side  of  rules  share 
one  node,  which  will  lead  us  to  new  discoveries  for  non-left -linear  TRSs.  We  have 
noted  that  most  of  research  on  efficient  term  rewriting  before  focuscxl  on  left-linenr 
TRSs. 

Let  rule  /  — >  r  rewrite  ternr  i  and  /  be  a  non-linear  term.  Specifically,  suppose 
variable  x  occurs  in  /  multiple  times.  In  dag  representation,  all  occurrences  of  .r  in  / 
share  one  node.  When  /  matches  a  subterm  t'  of  t.  all  occurrences  of  ,r  in  /  match  a 
same  subterm  of  /'.  Due  to  this  fact,  all  discussions  on  rewriting  by  left-linear  TRSs 
in  the  last  subsection  are  valid  for  rewriting  by  non-left-linear  TRSs.  Let's  see  an 
example.  Most  of  proofs  in  the  last  subsection  used  '"swapping"  of  two  rewritings: 
Let  i?  be  a  non-ambiguous  and  left-linear  TRS.  t  a  redex  and  /'  a  redex  inside  /.  i.e.. 
f'  is  a  projier  subterm  of  /.  By  non-ambiguity  of  R.  i'  must  be  a  variable  redex  of 
/.  Since  R  is  left-linear,  the  \'ariables  at  left  side  of  rules  are  different  and  they  may 
match  different  subterms.  No  matter  when  i'  is  rewritten.  /  is  always  a  redex.  That 
means,  rewriting  on  i'  has  ncj  effect  on  rewriting  on  /.  Hence,  rewritings  on  /  and 
i'  can  be  swapped  in  any  order.  In  dag  rewriting,  all  occurrences  of  a  varialdc  at 
left  side  of  a  rule  match  a  same  subterm.  Rewriting  on  /'  has  no  effect  on  rewriting 
on  t,  so  they  can  be  swapped  in  any  order.  By  comparison,  if  a  non-left-linear  TRS 
is  represented  by  trees,  we  cannot  do  this  "swapping".  For  example,  given  system 
{/(x,x)  — >  .V.  0  — *  h]  and  terin  f(a.a).  t  =  f{a.a)  is  a  redex  and  /'  =  left  son  a  of 
/  is  a  redex  inside  /.  If  f'  is  rewritten  to  b.  we  don't  have  any  redex  of  the  form  /(...) 
furthermore. 

The  advantage  of  dag  representation  for  non-left-linear  TRSs  described  al)ove 
can  be  used  in  all  j^roofs  for  the  non-left-linear  versif)n  of  theorem  1.  tlieorem  2.  and 
theorem  3.  That  is.  the  'ieft-linear"  condition  can  be  lifted.  We  rlon't  rejieaf  the 
proofs  here.   Therefore,  we  ha\'e 

Theorem  4  In  dag  rcwrifivg.  for  a  Jimi-amhiguoit.ti  and  vnrinhlc-jiioTr  TRS.  qivm 
any  term,  a.ny  dcrivatiov  for  that  tcrvi  i.^  optimal. 

Tlieorem  5  In  dag  rewriting,  for  a  non-ambiguoxis  TRS.  any  derivation  D  Iia.'^  an 
equivalent  outermost  derivation  D'  .'<uch  that  \D'\  <  \D\. 

Theorem  6  Iv  dag  rewriting,  for  a  nov- ambiguous  TRS.  any  outermost  derivation 
D  has  an  equivalent  variable -delay  derivation  D'  such  that  \D'\  <  \D\. 
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5      Conclusions 

We  have  designed  a  dag  representation  for  terms  and  term  rewriting  systems,  wliirh 
save  space  and  rewriting  steps  as  most  as  possible  in  the  view  that  redundant  space 
and  rewritings  are  avoided.  The  term  represented  by  a  tree  is  uniquely  mapped 
to  a  dag.  This  simple  corresponding  relationship  between  tree  representation  and 
dag  representation  provides  us  a  clear  base  to  establish  relationship  for  in:iportant 
properties  of  rewriting  system  between  tree  rewriting  and  dag  rewriting,  ^^p  have 
provided  efficient  strategies  for  dag  rewriting.  With  dag  representation,  we  have 
designed  efficient  strategies  for  non-left -linear  rewriting  systems. 

We  have  given  efficiency  analysis  for  each  step  in  rewriting  a  term.  But  more 
complexity  analysis  for  comparing  tree  rewriting  and  dag  rewriting  is  interesting.  In 
general,  the  more  number  of  subterms  are  shared,  the  more  efficient  the  dag  rewriting 
is.  We  may  consider  about  the  average  complexity.  Matching  two  dags  is  efficient. 
How  to  find  a  redex  in  a  dag  term  is  another  interesting  topic.  Since  occurrences  of 
subterms  in  a  term  are  much  fewer  if  the  term  is  represented  by  a  dag.  the  redex 
finding  process  will  be  much  more  efficient. 
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